Automatic model selection for partially linear models

نویسندگان

  • Xiao Ni
  • Hao Helen Zhang
  • Daowen Zhang
چکیده

We propose and study a unified procedure for variable selection in partially linear models. A new type of double-penalized least squares is formulated, using the smoothing spline to estimate the nonparametric part and applying a shrinkage penalty on parametric components to achieve model parsimony. Theoretically we show that, with proper choices of the smoothing and regularization parameters, the proposed procedure can be as efficient as the oracle estimator (Fan and Li, 2001). We also study the asymptotic properties of the estimator when the number of parametric effects diverges with the sample size. Frequentist and Bayesian estimates of the covariance and confidence intervals are derived for the estimators. One great advantage of this procedure is its linear mixed model (LMM) representation, which greatly facilitates its implementation by using standard statistical software. Furthermore, the LMM framework enables one to treat the smoothing parameter as a variance component and hence conveniently estimate it together with other regression coefficients. Extensive numerical studies are conducted to demonstrate the effective performance of the proposed procedure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Kernel Ridge Estimator for the Partially Linear Model under Right-Censored Data

Objective: This paper aims to introduce a modified kernel-type ridge estimator for partially linear models under randomly-right censored data. Such models include two main issues that need to be solved: multi-collinearity and censorship. To address these issues, we improved the kernel estimator based on synthetic data transformation and kNN imputation techniques. The key idea of this paper is t...

متن کامل

Comparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province

In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...

متن کامل

Comparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province

In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...

متن کامل

Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models.

Partially linear models provide a useful class of tools for modeling complex data by naturally incorporating a combination of linear and nonlinear effects within one framework. One key question in partially linear models is the choice of model structure, that is, how to decide which covariates are linear and which are nonlinear. This is a fundamental, yet largely unsolved problem for partially ...

متن کامل

A Comprehensive Model for R and D Project Portfolio Selection with Zero-One Linear Goal-Programming (RESEARCH NOTE)

Technology centered organizations must be able to identify promising new products or process improvements at an early stage so that the necessary resources can be allocated to those activities. It is essential to invest in targeted research and development (R and D) projects as opposed to a wide range of ideas so that resources can be focused on successful outcomes. The selection of the most ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of multivariate analysis

دوره 100 9  شماره 

صفحات  -

تاریخ انتشار 2009